Mirex 2012 Submission Audio Classification Using Sparse Feature Learning
نویسندگان
چکیده
We present a training/test framework for automatic audio annotation and ranking using learned feature representations. Commonly used audio features in audio classification, such as MFCC and chroma, have been developed based on acoustic knowledge. As an alternative, there is increasing interest in learning features from data using unsupervised learning algorithms. In this work, we apply sparse Restricted Boltzmann Machine to audio data, particularly focusing on learning high-dimensional sparse feature representation. Our evaluation results on two music genre datasets show that the learned feature representations achieve high accuracy.
منابع مشابه
Mirex 2011: Automatic Audio Tag Classification via Sparse Coding
This extended abstract details our submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2011 for the audio tag classification task. First of all, we extract a fixed-length feature vector (composed of some timbral as well as modulation spectrum features) from each song clip. Then, by using l-reconstruction to represent each test song clip as a linear combination of all train...
متن کاملMirex 2012 Submission Audio Classification Using High-dimensional Representations Learned on Standard Audio Features
We present a training/test framework for audio classification using learned feature representations. In contentbased music information retrieval tasks, standard audio features such as MFCC and chroma are typically used to represent the music content. As an alternative, there is increasing interest in learning feature representations from data using unsupervised learning algorithms. In the previ...
متن کاملMirex 2011: Music Geren Classification via Sparse Representation
This extended abstract details our submission to the Music Information Retrieval Evaluation eXchange (MIREX) 2011 for the audio training\test task. First of all, we extract a fixed-length feature vector (composed of some timbral as well as modulation spectrum features) from each training clip. Then, by representing a fixed-length feature vector (extracted from a test clip) as a linear combinati...
متن کاملMIREX 2010 Audio Onset Detection
This paper presents an approach for the Audio Onset Detection task [1], which is submitted to MIREX 2010. In MIREX 2009, we presented our approach that utilizes information on the general characteristics of the notes for onset categorization, as well as integrates energy-based and pitch-based detection results. In MIREX 2010, we extend our submission to MIREX 2009 by parameters fine-tuning and ...
متن کاملMirex 2012: Mood Classification Tasks Submission
In this work, three audio frameworks – Marsyas, MIR Toolbox and PsySound3, were used to extract audio features from the audio samples. These features are then used to train several classification models, resulting in the different versions submitted to MIREX 2012 mood classification task.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012